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Expansion of the Actin Gene Family in Amphioxus 
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Abstract: Actins are a small family of ubiquitous proteins that are essential cytoskeletal components and are highly 
conserved during evolution. Actins are usually divided into two classes, the cytoplasmic and muscle actins, which have 
different functional roles. Here we systematically analyzed the actin genes in the genome of the primitive chordate 
amphioxus (Branchiostoma floridae). We found that amphioxus contains more than 30 actin genes, many of which are 
linked. Phylogenetic analysis suggests the amphioxus actin genes have clearly undergone extensive expansion through 
tandem duplications. The actin genes’ structure also varies a lot, containing 2 to 7 exons. We also cloned two muscle type 
of actin genes from the amphioxus (B. belcheri) and compared their expression patterns during early development. The 
slight difference in their expression suggests functional diversification of these actin genes. Our results shed light on the 
evolution both of actin genes themselves and their functional roles in chordate development. 
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Actins are a small family of ubiquitous proteins that 
are essential cytoskeletal components and are highly 
conserved during evolution. Actins are usually divided 
into two classes, the cytoplasmic and muscle actins, 
which have different functional roles (Pollard & Cooper, 


1986). Vertebrates have at least six actin isoforms 
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(Vandekerckhove & Weber 1984; Miwa et al, 1991), two 
cytoplasmic actins (f-cytoplasmic, y-cytoplasmic) and 
four muscle actins (a-skeletal, a-cardiac, a-vascular, and 
y-enteric). The cytoplasmic actin is widely expressed in 
many tissues while the muscle actins are specifically 


expressed in different types of muscles. In chordates, the 
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cytoplasmic and muscle actin are different at about 20 
amino acid positions, which serve to distinguish the two 
types of actins (Vandekerckhove & Weber 1979, 1984; 
Kovilur et al, 1993). Also, the cytoplasmic and muscle 
actin genes have different intron-exon structures in 
mammals. 

The actin genes in the primitive chordates, 
urochordates and cephalochordates, have been subjected 
to several studies (Bovenschulte & Weber, 1997; 
Kusakabe 1995, 1997; Kusakabe et al, 1997a, b, 1999; 
Suzuki & Satoh, 2000). The amphioxus, which is thought 
to be the closest living relative of the vertebrates, has 
been a model system to study the evolution and 
mechanisms of development in vertebrates (Holland et al, 
2004). The Florida amphioxus (Branchiostoma floridae) 
cytoplasmic and muscle actins are clearly distinguishable 
and closely related to vertebrate cytoplasmic and muscle 


1997b). The 


cytoplasmic actin gene BfCA1 is widely expressed while 


actins, respectively (Kusakabe et al, 


the muscle actin BfMA 1 is expressed strongly in somatic 
muscle and weakly in smooth muscle associated with gill 
slits. Also, southern blot analysis of genomic DNA has 
suggested that the amphioxus genome contains multiple 
muscle and cytoplasmic actin genes that might have 
arisen from recent duplication and gene conversion. 
Recent results indicate that vertebrate actin isoforms 
evolved after the separation of the vertebrates and 
urochordates (Kusakabe, 1995; Kusakabe et al, 1997a). It 
has also been suggested that the actin genes have arose 
independently in each of the three chordate lineages. 
With the nearly complete genome sequencing of the 
Florida amphioxus (B. floridae, Putnam et al, 2008), we 
can systematically study the actin genes in this primitive 
chordate species. We found that the amphioxus contains 
more that 30 actin genes, many of which are linked. 
Phylogenetic analysis suggests that the amphioxus actin 
genes have clearly undergone expansion through tandem 
and clusteral duplications. The comparative analyses of 
the actin genes’ structure suggest a model of the 
evolution of these genes. We also cloned two muscle 
type of actin genes from the amphioxus and compared 
their expression patterns during early development. The 


slight difference in their expression suggests functional 
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diversification of these actin genes. Our results shed light 
on the evolution both of actin genes themselves and their 


functional roles in chordate development. 


1 Materials and Methods 


1.1 Blast searches and phylogenetic analysis 

The coding region of Branchiostoma floridae 
muscle actin gene (B{MA1, GenBank accession number 
D87407) was used to blast the B. floridae genome 
(http://genome.jgi-psf.org/Brafll). The corresponding 
regions were extracted and the gene structures were 
(http://genes.mit.edu/ 


analyzed using Genomescan 


genomescan.html) and manual corrections. <A 
phylogenetic tree was constructed by the neighbor- 
joining method using the MEGA4 program (Zhao et al, 
2007). 
1.2 Cloning of Branchiostoma belcheri muscle actin 
genes and in situ hybridization 

Adult amphioxus (B. belcheri) were collected 
during the breeding season from the South China Sea and 
maintained in the laboratory. Naturally fertilized eggs 
were collected and cultured at room temperature. The 3’ 
partial sequences plus the 3' untranslated regions of two 
B. belcheri muscle type actin genes were cloned using 3' 
RACE. One corresponds to the published BfMA1 
(Kusakabe et al, 1999), and a second one corresponds to 
a new muscle actin gene (corresponding to the 23-9 gene, 
see below) whose corresponding genomic sequence is 
available in B. belcheri. The gene specific primers used 
were: 5'-ACATCCGTAAGGACCTGTACGCCAAC-3' 
(Bf{MA1) and 5'-GATCGTCAGCAGAAGGAGATCA 
CCTT-3' (23-9). Total mRNA of 36-hour larvae was 
used as template for the reverse transcription. The PCR 
products were then cloned into pGEM-T vector, 
sequenced and used to prepare probed for in situ 
hybridization. Jn situ hybridizations were carried out as 
described (Zhang & Mao, 2009). 


2 Results 
2.1 The actin genes in amphioxus genome 

Taking advantage of the nearly completed 
sequenced genome of the Florida amphioxus 


(Branchiostoma floridae), we searched systematically for 
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actin homologous genes. We made blast searches using 
the published B. floridae muscle actin (Kusakabe et al, 
1997b) coding region as the query. In total 11 scaffolds 
were found to contain actin homologous sequences. The 
corresponding regions were extracted and analyzed by 
Genomescan and manual corrections. Surprisingly, 37 
genes were predicted, 27 of which were full length and 
the rest contained gaps in the sequences (Fig.1, Tab. 1). 
Most of them (31 out of 37) are organized into 5 gene 
clusters (Fig.1). The genes are referred to according to 
their locations on the scaffolds (Fig.l, Tab. 1). We also 
found that scaffold 264 (16kb) and partial sequence of 
scaffold 226 (8 kb, containing genes 226-1 to 226-3) 
show a high similarity with sequences in scaffold 231. 
We can not rule out the possibility that they might 
represent different alleles of the same cluster, or they can 
result from recent cluster duplications. Taking into 
account also the possibility of miss-assembly of the 
sequences, the confirmation of the exact number of actin 


genes in amphioxus still awaits the improvement of the 


sequence data. Phylogenetic analysis (Fig.2) of the 27 
predicted full length actin proteins showed that they fell 
into three different groups: the muscle type, the 
cytoplasmic type and those that showed similar 
homology to both muscle and cytoplasmic types. In 
addition to the reported cytoplasmic actin (Kusakabe et 
al, 1997b, corresponding to gene 68), a cluster of 4 genes 
(on scaffold 207) was also clustered into the cytoplasmic 
type. They likely represent a new type of cytoplasmic 
actin. 
2.2 The structure and evolution of the actin genes 
The amphioxus muscle and cytoplasmic actin genes 
have been reported to contain 4 and 5 exons respectively. 
Indeed, in the genes with confirmed exon-intron 
structures, 14/31 contain 5 exons, 9/31 contain 4 exons. 
Interestingly, genes with different structures also exit. 
5/31 contain 6 exons, two of them have only two exons, 
and one gene even contains 7 exons (Tab. 2). Most of 
genes locate in the same scaffold have the same structure 


(Fig.l, Tab. 2), strongly suggesting that tandem 
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Fig. 1 The actin gene clusters in the genome of Branchiostoma floridae 


The genes are referred to according to their locations on the scaffolds. The scaffolds containing single actin genes (25, 68, 107, 121, 164 


and 227) were not shown here. The directions and gene structures were illustrated. The scaffold numbers are according to the second 


version of the genomic sequence of B. floridae (http://genome.jgi-psf.org/Brafl1). 
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Tab. 1 The predicted actin genes in the Branchiostoma floridae genome 


Hypothetical protein No. Scaffold 
Gene name Gene ID Exons Note 
BRAFLDRAFT _ (Bf_V2_) 
23-1 91591 7240061 4 23 Identical with 23-2 at 
23-2 91591 7240061 4 protein level. 
23-3 91592 7219456 4 
23-4 91593 7213556 4 
23-5 91594 7219447 4 
23-6 127488 7247450 4 
23-7 115566 7247449 4 
23-8 91599 7213557 5 Gap 
23-9 115345 7240054 4 
23-10 126328 7213558 4 
23-11 126329 7219458 5 
25 117851 7214412 2 25 
68 56687 7213170 5 68 
107 116948 7207692 6 107 
121 114976 7242166 6 121 
164 58093 7235147 5 164 Missing first 19aa' 
207-1 114535 7218086 6 207 
207-2 6? Gap 
207-3 104919 7218293 6 
207-4 104920 7219752 6 1 amino acid difference 
226-1 5? 226 Gaps 
226-2 86184 7243434 5 
226-3 Gaps 
226-4 2? Gaps 
226-5 117851 7214412 
227 201011 7216746 7 227 13 additional aa' 
231-1 115291 7219299 5 231 
231-2 5? Gaps 
231-3 Gaps 
231-4 275467 7213337 5 
231-5 5? Gaps 
231-6 126063 7211552 5 Fusion? 
231-7 126063 7211552 5 Fusion? 
231-8 5 Gaps 
264-1 5 264 Gaps 
264-2 88990 7246371 5 
264-3 125401 7246689 5 


Annotated hypothetical proteins and genes in the GenBank identical or close to our predicted genes were listed. 


1, The hypothetical proteins in the database contain less or more amino acids than our predicted version. 


2, The hypothetical protein 126063 is a fusion version of 231-6 and -7. 


duplication was the major means of expansion of the 
actin gene family in amphioxus. Since most of the intron 
positions (42, 123, 206, 270, Tab. 2) are conserved 
across species (Kusakabe et al, 1999), even in 
invertebrate actin genes, we prefer the model that the 
ancestor actin gene to be multi-exon. The actin genes 
contain only two exons might arise due to intron loss 
during evolution. Since intron gain and loss happen 
heavily during evolution, it is not easy to reconstruct the 
evolutionary history of the amphioxus actin genes 
according only to their structures. 


2.3 Expression of two muscle type actin genes in 


amphioxus (B. belcheri) 

In vertebrates, the muscle type actin genes were 
differentially expressed muscle types, e.g., the skeletal, 
smooth and cardiac muscles. Since multiple muscle actin 
genes were present in amphioxus, we cloned partial 
sequences of two different muscle type actin genes and 
compared their expression during amphioxus early 
development. The two actin genes showed similar 
expression in the somatic muscles in neurula and early 
larvae of amphioxus (Fig.3: A-C, F-H), but their 
expression showed slightly differences at later stages. 


The expression of 23-1 was much stronger than that of 


No. 5 WEI Yun-hu et al: Expansion of the Actin Gene Family in Amphioxus 477 


91) 23-1 
66 | 23-2 
84 IL 3.3 
zs 23-5 
97 23-4 
23-6 
100 
97 23-7 
121 
48 we 
62 | . 164 z 
Â, 
23-9 T 
— a 
56| joo 23-10 =. 
| Cardi 
ne Human 
27 99 Enteric muscle actin 
97 t Vascular 
231-1 
. 107 
89 
5] 
100 226-5 
gq! 25 
99 Beta Human "m 
a cytoplasmic actin 3 
65 81 5, 
68 i 
207-4 = 
63 e 
207-1 = 
32 81 = 
65 207-3 
264-3 
64 226-2 
100 231-6 
| 231-4 
| é 
24 231-7 
100 264-2 
= 23-11 
227 
0.05 
e—<=F>-_—"1 


Fig.2 Phylogenetic analysis of the amphioxus and human actin proteins 
The phylogenetic tree was constructed by the neighbor-joining method using the MEGA4 program. The predicted Branchiostoma floridae 
actins were referred to according to their locations on the scaffolds (Fig.1 and Tab.1). The muscle and cytoplasmic clusters of actin genes 
are indicated. The rest genes do not show a closer relationship to either cytoplasmic or muscle type actins. The accession numbers of the 
human actin protein sequences used were: B-cytoplasmic: M10277; y-cytoplasmic: NP 001605; a-skeletal: M20543; a-cardiac: J00073; 
a-vascular: BAA00492; y-enteric: X26940. 
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Tab. 2 Comparison of intron positions in amphioxus actin genes 


Genes 

42 64 
23-1-7, -9, -10 
23-8, -11; 68; 164; 226-2; 231-1-8; 264-1-3 + 
207-1-4 + 
25; 226-5 
107 F 
121 + 
227 F 


83 


Intron position 


114 123 154 169 206 270 309 
+ -+ + 
+ + + 
+ + + + 
+ 

+ -+ + + 

+ + + -+ 
-+ + -+ + 


Nomenclature for intron positions is according to the number of the corresponding amino acids in the protein 


(Bovenschulte & Weber, 1997). Note that due to slight length variations, the exact position of each intron in different 


actins might vary slightly from listed here. 


23-9 at larval stages. At early larva stage (24h), the 
expression of 23-1 became weak in the middle region of 
the body (Fig.3D) and the expression of 23-9 was only 
clear in the anterior part of the body (Fig.3I). At the 
45-hour larva stage, the expression of 23-1 was still 
strong in the muscle in the anterior trunk and the tail bud 
(Fig.3E) while the expression of 23-9 became very weak 
and could be detected only weakly in the pharyngeal 
region (Fig.3J). 


3 Discussion 


Actins are very well conserved at protein level during 
evolution. Here we showed that the amphioxus actin 
genes have clearly undergone rapid lineage specific 
expansion and contain more than 30 actin genes. Some 


of them fall clearly into the muscle or cytoplasmic types, 


while the rest show similar homology to both muscle and 
cytoplasmic actins. We also extracted the 1kb sequences 
upstream of the start codon and 300 bp 3' untranslated 
region (UTR) sequences of the actin genes and tried to 
construct phylogenetic trees using these sequences. 
However, the result showed that the 3' UTR sequences 
have become quite divergent and the tree was no longer 
reliable (data not shown). The 5' Ikb regions contain the 
5' UTR sequences and the potential promoter regions. 
Comparisons of these sequences revealed partial 
conservation of some of the sequences (data not shown). 
Detailed analysis of the potential promoter regions of 
these genes might help to resolve their transcriptional 
regulation mechanisms. It will be of interest to check 
systematically the expression patterns of the different 


actin genes to follow their potential functional 








E 
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Fig. 3 Embryonic and larval expression of the muscle actin genes 23-1 (A-E) and 23-9 (F-J) in Branchiostoma belcheri 


(A,B) Mid neurula (14h) with strong expression in the somatic muscle (arrowheads). (C,H) Late neurula (18h) with strong expression in 


the somatic muscle (arrowheads). (D, I) Early larva (24h) with expression in the muscle (arrowheads). Note that the expression of 23-1 


became weak in the middle region of the body (D) and the expression of 23-9 was only clear in the anterior part of the body (1). 


(E, J), 


45-hour larva. At this stage, the expression of 23-1 was still strong in the muscle in the anterior trunk and the tail bud (arrowheads in E 


and insert) while the expression of 23-9 became very weak and could be detected weakly in the pharyngeal region (arrowhead in J). (B), 


(G), dorsal views, anterior to the left. All other panels are lateral views, anterior to the left. Scale bar, 100 um. 
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divergences. In sea urchins, the actin gene family has 
been also reported to be rapidly evolving whose 
members are expressed in distinct cell lineages in a 


developmentally regulated fashion (Kissinger et al, 


1997) . Since they are usually clustered, the amphioxus 
actin gene family is also a good system to study gene 
evolution and the evolutionary relationships between 


available amphioxus species. It will be of interest to 
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